Chapter 1 Unstructured Environmental Audio : Representation , Classification and Modeling
نویسندگان
چکیده
Unstructured audio is an important aspect in building systems that are capable of understanding their surrounding environment through the use of audio and other modalities of information, i.e. visual, sonar, global positioning, etc. Consider, for example, applications in robotic navigation, assistive robotics, and other mobile device-based services, where context aware processing is often desired. Human beings utilize both vision and hearing to navigate and respond to their surroundings, a capability still quite limited in machine processing. The first step toward achieving recognition of multi-modality is the ability to process unstructured audio and recognize audio scenes (or environments). abStract
منابع مشابه
A Joint Semantic Vector Representation Model for Text Clustering and Classification
Text clustering and classification are two main tasks of text mining. Feature selection plays the key role in the quality of the clustering and classification results. Although word-based features such as term frequency-inverse document frequency (TF-IDF) vectors have been widely used in different applications, their shortcoming in capturing semantic concepts of text motivated researches to use...
متن کاملUnstructured Audio Classification for Environment Recognition
My thesis aims to contribute towards building autonomous agents that are able to understand their surrounding environment through the use of both audio and visual information. To capture a more complete description of a scene, the fusion of audio and visual information can be advantageous in enhancing the system’s context awareness. The goal of this work is on the characterization of unstructur...
متن کاملImplementation and Performance Evaluation of Acoustic Denoising Algorithms for UAV
.................................................................................................................................. iii ACKNOWLEDGMENTS ............................................................................................................. iv LIST OF TABLES .........................................................................................................................
متن کاملLatent acoustic topic models for unstructured audio classification
Samuel Kim, Panayiotis Georgiou and Shrikanth Narayanan APSIPA Transactions on Signal and Information Processing / Volume 1 / December 2012 / e6 DOI: 10.1017/ATSIP.2012.7, Published online: 10 December 2012 Link to this article: http://journals.cambridge.org/abstract_S2048770312000078 How to cite this article: Samuel Kim, Panayiotis Georgiou and Shrikanth Narayanan (2012). Latent acoustic topic...
متن کاملImage Classification via Sparse Representation and Subspace Alignment
Image representation is a crucial problem in image processing where there exist many low-level representations of image, i.e., SIFT, HOG and so on. But there is a missing link across low-level and high-level semantic representations. In fact, traditional machine learning approaches, e.g., non-negative matrix factorization, sparse representation and principle component analysis are employed to d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016